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SL2 MIFRKFLNFLKPYKMR 16 

aaaatatcgtcaggaaataataacatttcagatataccctgaactctacagtttATGATATTCAGGMTTTCTGAATTTTCTGMCCTTACAi^ 1394 

T D P I I F V I G C T G T G K S D L G V A I A K K Y G G E V I S V 49 

GAACGGATCCGATTATTTTCGTGATTG(3GTGCACTGGAACCGGGAAMGTGATCTTGGAGTGGCAATTGCM 1494 

T SHP109 

D S M Q F Y K G LDIATNRIT 66 

AGATTCAATGCMTTTTATAMGgtacatgggttttgtttcaattttaaattaattaattttcgtttttcagGACTTGACATT(X:CACGAATAAGATAAC 1594 



EEESEGIQHHMMSFLNPSESSSYNVHSFREVTL 99 

GGAAGMGAATCTGAAGOTCAACATCATATGATGTCATTTTTGAATCCATCTGMTOT 1694 

SHP94 T 

D L I K K I R A R S R I P V I V G 116 

GATCTTATTAMqtqcttaattcqccactttttqaacttqa 1794 

T SHP95 

GTTYYAESVLYENNLIETNTSDDVDSRSRTSSE 149 

GAGGAACCACTTATTATGCTGAAAGTGTCCTTTATGAGAATAATCTGATTGAAACCAACA CTTCAGATGACGTGGATTCCA AATCGAGAACATCATCAGA 1894 

SHP96 T 



SSSEDTEEGISNQELWDELKKIDEKSALLLHPN 182 
ATCGTCATCTGMGACACTGMGAAGGAATTO^ 1994 



ffo-1 continued... ^32 

NRYRVQRALQIFRETG 198 
AATCGmTCGACTACAGAGAGCATTGCAMTTTTCAGAGAMCTGgtaattgatttgcaaatttccagattaaaaacaaatcaagtaaagttttttgca 2094 



IRKSELVEKQKSDETVDLGGRLRFDNSLVIFMD 231 

gGMTCCGAAAMGTGAACTTGTTGAAAMCAGA^^ 2194 
SHP97 T 

ATPEVLEERLDGRVDKMIKLGLKNELIEFYNE 263 

ATGCAACACCTGAAGTHTAGMGAM 2294 

aaatatttgaatttttccagaaaaaaaaagaaaattttttattattttgtttttttttcattctttactattttccaaaaaagtttaaacttttgaaaac 2394 

H A E Y 267 

tgttcagaaaatgttcgtgtatttattttagcttactgaggcattatttcattgtgatttttactatactctataaactaaattttcagCACGCCGAGTA 2494 

INHSKYGVMQCIGLKEFVPBLNLDPSERDTLNG 300 

CATAMTCACAGCAAATATGGTGTCATGCAATGTATTGGTCTTAAAGAATTCGTTCCATGGCTCAATTTGGACCCATCAG AAAGAGATACACTCAATGGG 2594 

^-CG^oOiesion 8HP98 * 

D K L F K Q G CDDVKLHTRQY 318 

GATAAATTGTTCAAGCAAGGgtaatttaaatttattttcaatttttataaattccaagctattttcagATGCGATGATGTGAAGCTTCACACTCGACAAT 2694 



p-l continued,,, 5/32 

ARRQRRWYRSRLLKRSDGDR 33 

ATGCACGGCGCCAGAGACGGTGGTATCGATCGAGACTTTTAAAACGGTCGGATGGTGATCGGgtatgttgattttaaaaaaattgaatttttaaagaact 219 

tttttactaaattaacaaagttattggctgaaaatggctgaaaattatagtaaaactaatcaaaaaaattgaaattttgaattaaagtcataaagtgacg 289 

KMASTKMLD 34 

accagaaaattaaaaaaaaacatttttctattttaattaattcactctacttcactttaaaaataattttcagAAAATGGCAAGTACAAAAATGCTGGAT 299 



TSDKYRIISDGMDIVDQWMNGIDLFED 37 

ACATCTGACMGTACCGMTMTTAGTGATGGMTGGACATTGnGATCAATGGATGAATGGMTCGATCTATTTGAAGATgtaaaatttcacaaattct 309 

ISTDTNPILKGSDANILLNCEI 39 

aaaatttccgaatcacaaattaaaatttctacagATCTCCACAGACACCAATCCAATTCTAAAAGGGTCCGATGCAAATATTCTGCTGAATTGTGAAATC 319 

CNISHTGKDNW QKEIDGKK \\ 

TGTAATATTTCAATGACTfmMGATMTTGgtttgtttcaatacatattataatttcgaaatgaattttttcagGCAGAMCATATCGATC^ 329 
SHPIIOV ▼ SHP100 

HKHHAKQKKLAETRT* 43 

GCACMGCATCATGCTAAGCAAAAGAAATTGGCAGAGACTCGCACAtaagacgctatatttattttttgttaacttaaattatttttgttgttgattgtt 339 

polyA 

ctctaaataaaaaaacagctcagagagaaga^taggcgctcgtccacatctccgacgatagtcaacccgaacgaagggaactatctttaattgtcagtga 349 

* SHP92 
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tgatttttactatactctataaactaaattttcagCACGCCGAGTACATAAATCACAGCAAATATGGTGTCACG 1197 

HAEYINHSKYGVT 276 

TTGGTCTTMGMTTCGTTCCATGGCTCAATTTGGACCCATCAGAM 1272 

LVIiKNSFHGSIWTHQKHIHSMGINC 301 



TCAAGCAAGGgtaatttaaatttattttcaatttttataaattccaagctattttcagATGCGATGATGtgaagcttc 1350 
S 8 X D A K M • 308 
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Sequence of GRO-1 and homologues 



• it it i i hi ill 



C.elegans 1 MIFRRFLNFLKPYKMRTDPIIFVIGCTGTGKSDLGVAIAKKYGGEVISVDSMQFYKGLDIATNKITEEESEGIQ 
S.cerevisiae 1 MLKGPLKGCLNMSKKVIVIAGTTGVGKSQLSIQLAQKFNGEVINSDSMQVYKDIPIITNKHPLQEREGIP 
I.coli i MSDISKASLPKAIFLMGPTASGKTALAIELRKILPVELISVDSALIYKGMDIGTAKPNAEELLAAP 



ATP/GTP 
binding site 



II MM I 



C.elegans . 76 HMMSFLNPSESSSYNVHSFREVTLDLIKKIRARSKIPVIVGGTTYYAESVLYENNLIETNTSDDVDSKSRTSSE 
S.cerevisiae 12 HVHNHVDWSE--EYYSHRFETECMNAIEDIHRRGKIPIWGGTHYYLQTLFNKRVDTKSSERKLTRKQLDILES 
I.coli 68 RLLDIRDPSQ--AYSAADFRRDALAEMADITAAGRIPLLVGGTMLYFKALLEGLSPLPSADPEVRARIEQQAAE 



C.elegans isi SSEDTEEGISNQELWDELKKIDEKSALLLHPNNRYRVQRALQIFRETGIRKSELVEKQKSDETVDLGGRLRFDN 

S.cerevisiae M7 DPDV IYNTLVKCDPDIATKYHPNDYRRVQRMLEIYYKTGKKPSETFNEQK ITLKFD-' 

I.coli io GWES LHRQLQEVDPVAAARIHPNDPQRLSRALEVFFISGKTLTELTQTSG DALPYQV 
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• •••••• 

C.elegans 226 LVIFMDATPEVLEERLDGRVDKMIKLGLKNELIEFYNEHAEYINHSKYGVMQCIGLKEFVPWLNLDPSERDTLN 
S.cemisiae 205 LFLWLYSKPEPLFQRLDDRVDDMLER6ALQEIKQLYEYYSQNKFTPEQCENGVWQVIGFKEFLPWLTGKTDDNT 
I,eoli 202 QFAIAPASRELLHQRIEQRFHQHLASGFEAEVRALFARGDLHTDLPSIRCVGYRQMWSYLEGEISYDEMVYRGV 

••••Ml , 

C.elegans 301 DKLFRQGCDDVKLHTRQYARRQRRWYRSRLLKRSDGDRKMASTKMLDTSDKYRIISDGMDIVDQWMNGIDLFED 
S.cemisiae 280 KLEDCIERMKT — RTRQYAKRQVKWIKKMLI PDIKGDILLDATDLSQWDTNASQRAIA I SNDF ISNRPIKQERA 
l #colJ 211 ATRQLAKRQITWLRGWEGVHWLDSEKPEQARDEVLQVVGAIAG • 

, C2H2 zinc finger , . 

C.elegans w ^nTPTT.KG^AWTIXN CEICNISMTGKDNWOKHIDGKKH KHHARQKKLATRT 

S.cerevisiae 353 KALEELLSKGETTMKKLDDWTHYTRNVCRNADGKNVVAIGEKYWKIHLGSRRHKSNLKRNTRQADFEKWKINKK 



tl/32 




/2/32 



Sequence of HAP-1 and its homologies 



•Mill • 



h. sapiens MAASLVGKKIVFVT6NAKKLEEWQILGDKFP CTLVAQKIDLPEYXG-EPDEISIQKCQE 

c, elegans " mlyilwklnylqkkmslrkinfvtgkvkkleevkailknfe vsnvdvdldepqg-epefiaerkcre 

S, cerevisiae MSNNEIVPVTGNANKLKEVQSILTQEVDNNNKTIHLINEALDLEELQDTDLNAIALAKGKQ 

E. coli MQKWLATGNVGKVRELASLLSDFGLD IVAQTDLGVDSAEETGLTFIENAILKA 



H, sapiens AVRQV-QG-PVLVEDTCLCFNALGXLPGPYIKWFL— EKLKPEGLHQLLAGFED KSAYALCTFALSTGDP 

C, elegans AVEAV-KG-PVLVEDTSLCFNAMGGLPGPYIKWPL— KNLKPEGLHNMLAGFSD KTAYAQCIFAYTEG-L 

s. cerevisiae avaalgkgkpvfvedtalrfdefnglpgayikwfl-ksmglekivkmlepfen knaeavtticfadsrg 

i coii rhaakvtalpaiaddsglavdvlggapgiysarysgedatdqknlqklletmkdvpddqrqarfhcvlvylrhae 



III I 1 1 I I II I 1 1 1 I 

H, sapiens sqpvrlfrgrtsgriv-aprgcqdfgwdpcfqp-dgyeqtyaempkaeknavshrfrallelqeyfgslaa 

c, elegans gkpihvfagkcpgqiv-aprgdtafgwdpcfqp-dgfketfgehdkdvkneishrakalellkeyfqrn 

s, cerevisiae e--yhffqgitrgkiv-psrgpttfgwdsifepfdshgltyaemskdaknaishrgkafaqfkeylyqndf 

b, coli dptplvchgswpgvitrepagtggfgydpiffv-psegrtaaeltreersaisergqalkllldalrng 
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mRNA sequence of human homologue of gro-1: hgro-1 



\^ X \j\^\^r\ X r\r\\D 




TGGPGGCTGr 

X OO^vJu\^ X 


APGAGCAGTT 


\_^» X V_j X v_J \_j xi 


P T CCCC T P A P 




PPPAPPPT AP 


PTPTTPTAPT 

\^ X X X \J X xi VJJ X 


PAT TPTPPPP 


p p p a rr p pp a 




P APPPTPPPP 


TTPP A PPT AP 


PPP APPPPPT 


L^- O v3 ^ VJJ <CJ 1 Oi-iVj 




PTPAPTPPAT 

X \JJxi v _L \^xi X 


PPAPPTPTAT 


PAAPPPPTAP 


APATPATPAP 




TPTPPPP AAP 


APP APAPA AT 

xi VJj V^xiVJJ xiVJJXixi X 


PTPPP PPP AP 


PAPATPATPA 


PPTTTPTPPA 


TPPTPTTPTP 


APPAATTAPA 


PAPTPPTGPA 


PTTPAPAAAT 


apapp a aptp 


PTPTPATTPA 

\_* X X V_J xi X X V_J ii 


APATATATTT 

nun x xv x xi xxx 


PPPPP APAPA 

VJj \_y VJ Xi V_J Xi Xi 


AAATTPPTAT 


tpttptpppa 

X O X X yJ X V3\jKj£-\ 


PPA APPA ATT 


ATTAP A TTPA 

xi. x x ni .rt. x x un 


ATPTPTPPTP 

xi. X X ^ X X 


TPPAAAPTTP 


TT^TPAATAP 


P A APP PPP AP 


PAPATPPPPA 


P TP APA A APT 

X \JJxi \_JxiXiXi V_J X 


PATTPAPPPA 


A A APTPPAPP 


TTPA A A APPA 


PPATPPTPTT 
vjjxi. x x ^ x x 


PT APTTP AP A 




PP APPTPPAP 


PP APA A ATPP 


PTPPP A APPT 


PP ATPP AP AT 


pzipaziApppzx 


AAPTPPPPAP 


P APPTTPP A A 


PTTTTTPAAP 

\J X X X X X \Dr\r\\J 


A AAP APPA AT 


PTPTPATAPT 


PAATTTPTPP 


ATPPTP A AP A 


T APPPA APA A 


PPTPPTPPTP 




TPPTPTPAAP 


TTPTPTAAPP 


PTTPP AT PPT 
\^ X X u^ri X X 


TTPPPTTPAT 
X 1 uuL- 1 X \^r\ X 




PAPT'TPTAPA 


TP A PP PPT TP 


PAT A AP APPP 


TPPATPAP AT 


PPTTPPTPPT 


PPPPTPTTPP 


APPA APT AAP 


APATTTTPAP 

iiVjii X X X X 


APAPPPTATA 

nun\> x xi x xi 


ATCAGAAGAA 


TGTTTCGGAA 


AATAGCCAGG 


ACTATCAACA 


TGGTATCTTC 


CAATCAATTG 


GCTTCAAGGA 


ATTTCACGAG 


TACCTGATCA 


CTGAGGGAAA 


ATGCACACTG 


GAGACTAGTA 


ACCAGCTTCT 


AAAGAAAGGA 


CCTGGTCCCA 


TTGTCCCCCC 


TGTCTATGGC 


TTAGAGGTAT 


CTGATGTCTC 


GAAGTGGGAG 


GAGTCTGTTC 


TTGAACCTGC 


TCTTGAAATC 


GTGCAAAGTT 


TCATCCAGGG 


CCACAAGCCT 


ACAGCCACTC 


CAATAAAGAT 


GCCATACAAT 


GAAGCTGAGA 


ACAAGAGAAG 


TTATCACCTG 


TGTGACCTCT 


GTGATCGAAT 


CATCATTGGG 


GATCGCGAAT 


GGGCAGCGCA 


CATAAAATCC 


AAATCCCACT 


TGAACCAACT 


GAAGAAAAGA 


AGAAGATTGG 


ACTCAGATGC 


TGTCAACACC 


AT AG AAAG T C 


AGAGTGTTTC 


CCCAGACTAT 


AACAAAGAAC 


CTAAAGGGAA 


GGGATCCCCA 



GGGCAGAATG ATCAAGAGCT GAAATGCAGC GTTTAA GAGA CATGTCCAGT- 

GGCCTTTGGA AAGGTGGTGG GGATCCAGTT CAGGAGGGAG GGGTATGTTT 

GTCTCCCAGT CTGGGCAAAG GAGTGCTATG CGGAATTCTC TGCATAGCAG 

AAAAGCTCCC ACCATTTTCT TTTGATGTGG TTTTAAAGTC TCACGTTCTC 

TATAATAGAA ACAGCAGGTC TTGTCAGCTC CTTGTGTGGC TGATGTGTCT 

GGAAATGATG TAGTTCAGGA AAGCATTTTT TTTTTCTTTG AACCTTAAAG 

GTTCTATTAT TAAAAGCAGC ACAGATTCCA CATTTTTATA CATGAGGATC 

TTCTTTGTGG TGAATACCAG GATTGACTGC ATCCCTTTAA AAGAAGTTTT 

ATGTCCCTGA CTCTGGCTAA AATTATCTAA TTTCCAGATG CTTTTGTAGA 

TGACTGAAGT ATTTGTGAGC CACATATTGG GAGTTCTAGA TTTGAGTGAA 

TGGCAGGAAA GGGCCATCTC CATTGAGATG ATTAAGTGAA CCAAACTAGT 

TCTCGGAATT CTACAGAGAA GGAGGGAATC AGACTGAGGA AGCTGTGACA 

TAGGACTTGA AGACCAAAGA CTTTGAAATT TGCGAGCTGC TCATGTGTGA 

GTTATTATCA CTGCTGTCTT TCTATTGAGT TACAAATCTA TATTTTTATT 

GAAGTTTAAA TAAAGAAAAA ATTTACAAGA AAAAAAAAAA A 



f 
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GRQ-1 and its human homoloaue hgro-1 p 



Ml I MMM I Ml I MM • MM M 

hgro-lp . WTOM^ 

GRD-1 pramm 



I | Ml M I It M I I I II Ml MM II M I 

GBD-l IQTIMGIQHK 



hgro-lp 
GRQ-1 



I • I I I II I I M I MM 



I I I 



TRPQMSTEMDRKVELEKEDGUf WS^^l^MmmSSimM 

OTSDDVDSKSRTSSI^ 



• 
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% % || III I I I M MM Ml I M M I 

hgro-lp SIMQHTiM 



I • III Ml • • • 

hgro-lp SMSQDYQHGIFQSIGFKEFHEYLITEGIKCTLETSNQLLKK^ 

(D-l YINHSKY--GVMQCIGLKEFVPWLTO 



hgro-lp VSDVSMSVi™ — - 

GD-1 RSDGDRKM/^M 



II I I I • I • I 



hgro-lp CDmillGDRMIKSKSMOLKKM 

C2H2 zinc finger 

i-==r - PFi 
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W ffi J 
> H ^ 

.•SI w 



0 cx 

ffi £ J 
co <J 
Q < 

^ on 
J H 

°> 

^ ^ r 
fe S * 

o 5 w 

o < < 

H Oh 
^ ^ < 
5 >4 



to 

Ph 



o 

co 



O 
H 
co 

CX 
> 



Oh 



a 
> * 



co 
Q 
> 



Ph 



Ph O 

> H 

CO Q 

O h 

CO CO 

^ > 

cx C 



Q 

Ph 

< 

Ph 
CO 

C/ 
W 
W 



CO 



< 

CO 

O 



H 

o 



^ _j H 

^ Ph < 

a > w 

w * < 

0 Ph H 

H ~ < 

Ph S Q 



o cx 

> J 

t-H W 



Ph 
CO 

H 
Q 



H 

CO 



^ < 
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Structure of 




Sad 
gtacgtglgagctc]' 



SHP161 



atcgtgttccaggtgiactatatattgagcaggaggacgagttgtttgtttcatgctgcttaaaaataaaaatg 

. J mi > 

cagcgagc[gca- / i 
SphI 

gaaaattgagtcaaaaagttgagataaaacaaattaaaacaattttctgaaaaataaacaactgaaatttgaagtaataaacaacacgcgaaaacgttat 



ttcggagcatcgtttgagaagtaaaactttttttcggcgcacccttgtgcgcagtttttatcttctcttttaatttaattttcaagctaaatctttcttt 



promoter 



ttaaactttq 



gro-1 



SHP159 M I. F R K.F .L N F L K P Y K M R 
► 

yataaatatttaaatattcagbtataccctgaactctacagttt ATGATATTCAGGMTTTCTGMTTTTCTGMCCmCMTGC 
SHP160 



T D P I I F V 1 G C T G T G K S D L G V A I A K K Y G G E V I S V 
GAACGGATCCGATTATTTTCGTGATTGGGTGCACTGGMGGGAAAAGTGATCTTGGAGTGGCAATTGCAAAGAAATATGGAGGAGAGGTGATTAGTGT 
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D S M Q F y K G L D I A T N . 

AGATTCAATGCAATTTTATAAAGgtacatgggttttgtttcaattttaaattaattaattttcgtttttcagGACTTGACATTGCCACGAAT. 



. ; . CATGCTAAGCAAAAGAAATTGGCAGAGACTCGCACA 

r 



sHpno 



AKQKKLAETRT* 

taagacgctatatttattttttgttaacttaaattatttttgttgttgattgtt 

— [tctagajtatact 
Xbal 

ctctaaataaaaaaacagctcagagagaagattaggcgctcgtccacatctccgacgatagtcaacccgaacgaagggaactatctttaattgtcagtga 

SHP162 I 

^■[ctqcagltgtcat 

PstI 
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Construction of pMQ1 8 



SHP151 



SHP170 



" aaa /v \ /v a 7 SHP1515HP170 

cz(Hb=i^\ = /hr \ = /Ub PCR product amplified 

promoter gra-/ from pMQ8 

Sphl Xbal 



1 kb 



pPD95.77 



gfp 3' UTR 



-L 



gro-1 gfp 200 aa 



I 



GRO-1 ::GFP Fusion Protein 



20/ 3 2 



atcgtgttccaggtgcaactatatattgagcaggaggacgagttgtttgtttcatgctgcttaaaaataaaaatggaaaattgagtcaaaaagttgagat -9557 

aaaacaaattaaaacaattttctgaaaaataaacaactgaaatttgaagtaataaacaacacgcgaaaacgttatttcggagcatcgtttgagaagtaaa -9457 

actttttttcggcgcacccttgtgcgcagtttttatcttctcttttaatttaattttcaagctaaatctttctttttaaactttgaataaatatttaaat -9357 

MFRKLGSSGSLWKPKNPHSLE 21 

attcagaatgcaccaataaacctggaacaaaatcgata ATGTTCCGCMGCTTGGTTCT TCTGGGTCACTATGGAAGCCGAAAAATCCGCATTCTTTGGA -9257 

SHP190 T 

Y L K Y L Q G V L T K N E K V T E N N K K I L V E A L R A I A E I 54 

ATACCTCAMTAnTACAAGGAGTG^ -9157 

LIffGDQNDASVFD F F L E R 72 

CTCATTTGGGGCGATCAGAATGATGCTTCGGTTTTTGAgtgagtttttttccaatgttttttttcaaatctgatgttgaatttcagTTTCTTCCTTGAGC -9057 

QMLLYFLKIMEQGNTPLNVQLLQTLNILFENIR 105 

GGCAAftTGCTTCTTTATTTCTTGAAAATTATGGMCMGGAMCACA CCACTAMTGTACAATTACT GCAGACTTTGAACATTTTATTCGA -8957 

---- - ▼ SHP171 

H E T S L Y FLLSNNHVNSII 123 

ACATGAMCTTCACTTTgtaagttttttatatggattttcgcttaaaattgccagttttcagATTTCCTTCTAAGTAACAATCATGTAAACTCGATTATT -8857 



SHKFDLQNDEIMAYYISFLKTLSFKLNPATIHFF 157 
TCCCACAMTTCGATTTACAAAATGATGAGATCATGGCnACTACATTAGTTTTCTGAAAACTCTTTCATTTAAACTGAATCCAGCTACAATCCACTTCT -8757 



gop-l continued,.. 2,/32 

F H E T T E E F P L L V E V L K L Y N W N E S H V R I A V R N I L 190 

TCTTCMTGMGACTGMGM ^ -8657 

SHP172 T 

L N I V R V Q D D S H I I F A I K H T K 210 

TTTAAATATTGTGAGAGTTCAAGATGATTCAATGATTATTTTCGCTATCAAGCATACAAAAgttagtagaaaattattttgaaaaggtgtatttaagcaa -8557 

E Y L S E L I D S L V G L S L E H D T F V R S A E N V L A H 240 

taaatattacagGAATATCTATCGGAGTTAATAGATTCTCTAGTTGGTCTCTCACTTGAAATGGACACATTTGTACGATCTGCTGAGAATGTGTTAGCTA -8457 



R E R L R G K V D D L I D L I H Y I G E L L D V E A V A E S L S I 273 

ATCGAGAGAGATTACGA GGAAMGTGGATGAnTMT TGATTTGATTCATTATATTGGT GAACTATTGGATGTGGAAGCTGTCGCCGAAAGTTTATCAAT -8357 
SHP142 m\ T 

L v T T R Y L S P L L L S S I S P R 291 

TTTAGgtcagttttactgctggaaaatcaagtttttaatgttaaattttcagTAACAACACGATACTTAAGCCCTCTATTACTTTCAAGTATATCACCAA -8257 



RDNHSLLLTPISALFFFSEFLi 313 

GAAGAGATMTCATTCACTTCTACTCACTO^ -8157 

IVRHHETIYTFLSSFLFDTQNTLTTHWI 341 

aat t ta 1 1 1 a 1 1 1 1 1 cag ATAGTTCGTCACCATGAAACAATATATAC ATTTTTATC ATCTTTCCTATTTGACACTCAGAATACTTTGACGACCCATTGGA -8057 

RHNEKYCLEPITLSSPTGEYVNEDH 366 

TACGTCATAATGAGAAATATTGCTTAGAACCGATTACATTATCATCACCAACCGGAGAATATGTGAATGAAGACCAgtaagagctgaaattttaaaattt -7957 



VFFDFLLEAFDSSQADDSKAFYGLM 391 
ttgctttgaatatagtattttcagCGTATTTTTCGATTTTCTACTGGAAGCATTTGATTCCAGTCMGCAGACGATTCGAAGGCATTCTATGGATTAATG -7857 



pp-l continued... 
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LIYSMFQNNA 401 
CTGATTTATTCAATGTTTCAGMTAATGgtgagttttaaaaaattgatttgttaaattaaaatttccatttccaataactcctcttcagacagtaagttt -7757 

tcaatgttgtaaagttcctgttcatctgtgatcgttttcttcatttttttagttttqcatgaacagttttcaaatttttttgatatcatacagtaaatat-7657 

cgtcatccagataattttctatttaaaaaaaatgaataaaaagagggcgcgcagaaattgccgaagtaatgtaaatttaaagggacacatgcgtagcttg -7557 

ttgtgtgggtctcgccgcgctttgtttgatttatcttgttttctgctcaaagagctgtttttattttagcgttgaatgcttttttaccgttctcatcggc -7457 

tttttaataggaatatttaaaaaaaaaggtttaataaatcttcgtttttacaaaatccatctaagatttgcatttgtgaagctcaacaagtaaagtttta -7357 

agtaacattgttttttaaaaaacaattgaaccaaattttgccgaaacattaataacatgacgatactctataaaatattcctcttttcaaaataaatttt -7257 

DVGELLSAANFPVLKESTTTSLAQQN 427 
caaaaaaaatccatttttcagCCGATGTTGGAGMCTTCTATCTGCTGCCAACTT CCCAGTGCTCAAAGMTCAACG ACMCTTCATTAGCTCAACRGAA -7157 

T SHP174 

L A R L R I A S T S S I S K R T R A I T E I G V E A T E E D E I F 480 

TCTTGCTCGTCTCCGAATAGCATCTACGTCTTCCATATCAAAGCGMCGAGAGCTATCACTG AAATTGGAGTAGAAGCGACC GAGGAAGATGAGATTTTT -7057 

SHP185 * 

HDVPEEQTL 469 

CATGATGTTCCTGAAGAACAAACGTTGgtaagtaaataaatcaacattgattgttacacaaactttaatatttttaaatttgaaaattttcttcaaagtg -6957 

EDLVDDVLVDTENSAISDPE 489 
ctcaaaaatcctgtcgaaaattacagGAAGATCTGGTGGATGATGTATTGGTTGATACTGAAAATTCAGCAATAAGTGATCCAGAAgtgagtagaaaacg -6857 

PKNVESESR 498 

tgcatgtattaattattaaaaaaaaaatatagttttccccagttttccttgacctaaaactcagcaatttcagCCTAAAAACGTGGAGTCAGMTCTCGT -6757 



fltip-i continued.,, 23/32 

S R F Q S A V D E I P P P S T S G C D G R L F D A L S S I I K A V G 532 
TCTCGATTTCAATCTGCTGTTGATGAGCTTCCACCTCCGTCGACTTCTGGATGTGATGGTCGACTTTTTGATGCACTTTCATCGATTATCAAAGCAGTTG -6657 



T D D N R I R P I T L E L A C L V I R Q I L N T V D D E R 561 

GMCAGATGACMTCGAATmCCAATTO -6551 
SHP175 * 

VHTSLTKLCFEVRLKLLS 519 

aattcaaaattgagcaaaatcagaatctaaatttcataaattgttcagGTACATACCAGTTTAACGAMmTGCnCGMGnCGTCTAAMCTTTTAT -6451 

SIGQYVNGENLFLEWFEDEYAEFE 603 

CATCAATTGGACAATATGTTAATGGAGAGAATCTGTTTTTGGAGTGGTTTGAGGATGAATATGCAGAATTTGAAgtaagccaagaggtccgaaaataatt -6351 

VNHVNFDIIGHEMLLPPAATPLSNLLL 630 

taattcatcctttttattcagGTGAATCACGTGMTTTCGATATAATCGGTCACGAAATGCTTCTTCCTCCAGCTGCAACTCCTCTTTCGAATCTGCTAC -6251 

HKRLPSGFEERIRT Q I V 641 

TTCATAAGCGATTGCCCAGTGGATTTGAAGAACGAATAAGAACTgtaggaaactttttaaatttgaaaattaattatatatatatttgcagCAMTCGTA -6151 

FYLHIRKLERDLTGEGDTELPVRVLNSDQEPVAI 681 
TTCTACCTACATATTCGAAAATTGGAACGAGATTTGACCGGTGAAGGAGACACAGAATTACCTGTGAGAGTGTTGAATTCTGATCAGGAACCAGTTGCCA -6051 



G D C I N L H NSDLLSCT 696 

TCGGTGATTGTATTAATTTACgtgagttcatctgcatagaaaacaccatatttctactcaaattaacaattttcagATAATTCGGATCTTCTATCCTGCA -5951 

VVPQQLCSLGKPGDRLARFLVTDRLQLILVEPD 129 

CTGT GGTTCCTCMCMCTATGTTC TCTTGGAAAACCTGGTGATCGTCTTGCTCGATTCCTTGTCACTGATAGACTTCAATTAATTCTTGTCGAACCGGA "5851 
* SHP176 

S R K A G W A I V R F V G L L Q D T T I N G D S T D S K V L H V V 162 

TTCTCGAMGCCGGATKGCMTTGTTO -5751 

siiii T 

V E G Q P S R I K K R H P V L T A 119 

GTGGAAGGGCAACCCTCGAGAATTAAGgtaagaatactaacgggaaaaaaaaatcaaaaaattacttctgtttcagAAAAGACATCCGGTTTTAACTGCA -5651 
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AFIFDDHIRCHAAKQRLTK 798 

AAGTTCATATTCGATGATCACATTCGGTGTATGGCAGCAAAGCAACGGCTCACCAAGgtaacggaaaaaataaccaaaaagacggaaagttattgtaaat -5557 

ggacgaaatcggcgaaattaattgaaaacgtttgaatttgccgctaaaaccaaacgaaaaccaaacgaaagcgaaatttaactatcccttcaggtagaat -5457 

G R Q T A R G L K L Q A I C S A L G V P R I D P A T 824 

atacattttatttctctttatagGGTCGCCAAACAGCACGTGGTCTGAAACTTCAGGCGATATGTTCAGCTCTTGGAGTTCCACGTATCGATCCAGCGAC -5357 

MTSSPRMNPFRIVKGCAPGSVRKTVSTSSSSSQ 857 

AATGACGTCATCACCACGAATGMTCCATTCAGM -5257 

GRPGHYSANLRSASRNAGMIPDDPTQPSSSSERR 891 

GGACGTCCCGGACATTATTCTGCAAATCnAG ATCAGCATCTAGAAATGCAGG AATGATACCAGATGATCCAACTCMCCGAGTAGTTCTTCGGAMGAA -5157 

SHP178 * 

S • 892 

GATCCtagggatcaatatctcttcagtttcatcattttatgctgtaaattgtatttaagtattcctattctttgtagtactgtatttacacatcgtctag -5057 



ttaaaatcacaaatctccgaaaaaacaaaccagtgaacatgtgatatttctcttgcccatagttctcttttttttttgaaacaaaaacaattacttttat -4957 

potyA 

gctcacctattcgagccatatttttttcccaattaccggttgtttattttaatttcttttttttttctgtaaatctactttatttttaaaactgcatttg -4857 

agattgtgtatattttttcaaaatggttcaaatgccgaatctatctactt -4807 



7 
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lop-1 

SI? 

^ MAEKAENLPSSSAEASE 1 

tttaatcattattcaaacagaaaaaccgattatttattcagattctcaaaaATGGCTGAAMAGCTGAAMTCTTCCATCTTCTTCGGCCGAAGC^^ -470 

EPSPQTGPNVNQKPSILVLGMAGSGKTTFVQ 4 

MGAGCCATCACCTCAAACTGGACCAAATGTGMTCAAAAACCATCGATTTTGGTTCTTGGAATGGCTGGTTCTGGAAAAACGACATTTGTTCAGgtaac -460 

RLTAFLHARKTPPYVINLDP 6 

tttcattcaattttgagagttttcaaacattactattttcagCGTCTCAM^ -450 

A V S K V P y P V N V D I R D T V K Y K E V H K E F G H G P N G A 10 

CGGCAGTTAGCAAftGTACCTTAT CCAGTGMTGTTGACATTCGA GATACTGTGAMTACAAGGMGTTATGAAAGAATTCGGAATGGGACCAAATGGAGC -440 

? SHP179 

IMTCLNLMCTRFDKVIELINKRSSDFSVCLLDT 13 

AATTATGACATG TCTTAACCTGATGTGTACTCG TTTTGATAMGTMTTGAGTTGATTMTAAGAGATCTTCTGATnCTCAGTTTGTCTTCTTGATACT -430 

SHP180 T 

P G Q I E A F T » S A S G S I I T D S L A S S H P T .16 

CCTGGACAMTTGAAGCATTCACTTGGAGTGCTAGTGGATCTATTATCACTGATTCATT GGCAAGTAGCCATCCCACGg taagggattttgatttatgaa -420 

T SHP143 " - 

atctgcttgaaatgaaaaaagattctaataaatttttgacttttaaacattttttacagttatatttggtctattttctatcattaaaagcaaaatgaaa -410 

VVMYIVDSARATNPTTFMSN 18 

agtcgattctactccatatttattaatttcgacttttcagGTGGTAATGTACATTGTGGATT CCGCTCGTGCCACAAATCCA ACTACATTCATGTCCAAT -400 

T SHP144 
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H L Y A C S I L Y R T K L P F I V "V F N K A D I V K P T F A L K W M 21 
ATGCTCTACGCATGTTCCATTCTCTACCGTACCAMCTTCCATTCATTGTCGTnTCAACAMGCTGATATTGTCAMCCAACATTTGCACTCAAATGGA -390 



QDFERFDEALEDARSSYMNDLSRSLSLVLDEFY 24 

TGCAAGATTTCGAAAGATTTG ATGMGCTnAGAGGATGCC AGMGCAGTTATATGMTGATTTGAGTCGTTCATTGAGTCTCGTTCTTGATGAATTCTA -380 

T 

SHP181 

C G L K T V CVSSATGEGFEDV 26 

TOGGACT(MACAGgtttttattcgaaataaaaccttm -370 

M T A I D E S V E A Y K K E Y V P M Y E K V L A E K K L L D E E E 29 

MTGACAGCMCGATGMGTGTTGAAGCATACAAAAAAGMTATGTTCCMTGTATGAAMAGTGn -360 

R K K R D E E TLKGKAVHDLNKV 31 

AGAAAGAAAAGAGATGAAGAGgtaattgtagtaatttaattctgattatcttcaaattttcagACTCTGAAAGGAAAAGCTGTTCACGACCTGAACAAAG -350 

ANPDEFLESELNSKIDRIHLGGVDEENEEDAEL 35 

TCGCCMTC CCGACGAATTTCTGGAGTCGG AGnGMTTCAAAMTCGATAGAATTCATTTGGGCGGAGTCGATGAAGAGAATGAGGAGGATGCTGAACT -340 

SHP182 T 

E R S • 35 

CGAAAGATCCtgattttctttttgtttttgaatttttattctattttgatccctgtttacttcttattgttctcattttgttgcgttgttttacatttta -330 

polyA 

ctcatttttgcataaacttgttgcaaaaataatataatttttgatctggaaatggttttaaaccttaacctttcatatattaataattttttttcaaaa -320 



aaacgttctaaaaaggttcctcattttttcaatataggaaattttgaaga 



-315 
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SL2 

"\ H S E K T F H K 8 

tcttttccaaaaatgaggttcttcgcttgaaaagccaacatttaaaacctttttttttccagaaacctagtggttaATGTCTGAAAAGACGTTCCACAAG -3051 

A Q T I R A K A S G V P S I V E A V Q F H G V R I T K N D A L V K E 42 

GCACAGACCATCCGTGCAAAGGCATCCGGAGTGCCTTCAATCGTCGAAGCTGTACAGTTTCATGGAGTTCGCATCACAAAAAACGATGCTTTGGTTAAGG -2951 

V S E L Y R 48 

AGgtactacccaaatttcaaaatgttgcacaattcaattgaaaatataaattgtgaattaaattcaacttacatgttttttcagGTTTCCGAATTATACA -285 

S K N L D E L V H N S B L A A R H L Q E V G L M D N A V A L I D T 81 

GMGTAAAMTCTAfflTGAACTTGTTO -215 

T SHP183 

SPSSNEGYVVNFLVREPKSFTAGVKAGVSTNGD 114 

ATCTCCAAOTCAAATGMGGATATGTTO -26 

A D V S L N A G K Q S V G G R G E A I N T Q Y T Y T V K 14 

GCGGATGTCAGHTAMTGCC(£AAM^ -255 

f SHP145 

gcactgccagtttggcatgttctcccaatattttttaattataaaatttggaagtataaaaaaatgtttgcttcatctaaaaatagcctttttcacatga -245 



aaaaaattgaaaaaaagtgctcaaaaatttcagaaatttccaatttccaaacaattttggagaactttcaaaaatttttccaactgaaattaaagctata -235 



147 
-225 



NISAIKPFLGWQKYSNVSATIYRSLAHMPWNQS 
C mWTCCGCmCAMCC ATTCCTGGGATO 

SHP138 T T SHP146 

DVDENAAVLAYNGQ L W N QKLLHQVKLNA 
GATGTTGAT GAGAATGCAGCTGTTCTTGCATATAATGGACAACTATGGAATCAAAAGCTTTTGCATCAAGTCAAATTGAATGCGqtaaaqtattataaqt 

I « R T L R A T R D A A F S V R E Q A G H T L 23 
gttttgtccaaactatgatacagttcttcagATATGGAGAACACTTCGTGCCACTCGAGATGCCGCATTTTCAGTTCGTGAACAAGCCGGACACACTTTG -195 

K F S L E N A V A V D T R 0 R P I L A S R G I L A 25 
AAATTCTCGTTGGAGAATGCTGTAGCTGTTGATACAAGAGATAGACCTATTCTTGCAAGTCGTGGAATTCTTGgtaagagtaacaacgactatttttaaa -185 



180 
-215 

208 
-205 



aaatatctttttcgaaaaaattacgaacgaaaaaaaactgtattatgtacccaaacgcgaaattttgcagttcttgcgcgttcttgttgataaaaaatat -115 

R F A Q 26 

gtaaaaaattggaaaaactacgaaaagtcgataaaaattccgtaccaaccggaaaatgtttcattaatttctcttccttttttcagCTCGTTTTGCTCAA -165 

EYAGVFGDASFVKNTLDLQ 279 

GAGTACG CAGGAGTATTTGGTGATGCGT CATTTGTGAAGAATACATTAGAmACAGgtaacaaccttatttcaacaattatttcaaattctattaaaaa -155 
SHP139 ? 

A A A P L P L G F I L A A S F Q A K H L K G L G D R E V H I L 31 

taattccagGCA OTGCCCCTCTTCCACTCGCT -H5 
SHP140 ^ 
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DRCYLGGQQDVRGFGLNTIG 330 
TGGATAGRTGTTATTTGGGT GGACAACAGGATGTTCGAGGATTTGGTCTGAATACTATTGGAqtgagttttaacgaaattctcttgaaagtcaaataatc -1357 
T SHP184 

VKADNSCLGGGASLAGVVHLYRPLIPPNMLF 361 
attttcagGTTAAAGCAGATAACAGTTGTCTTGGAGGAGGTGCTTCACTTGCTGGTGTCGTTCATTTGTATCGGCCATTGATTCCACCAAATATGCTATT -1257 

A H A F L A S G S V A S V H S K N L V Q Q L Q D T Q R V S A G F G 394 
TGCACACGCATTCCTTGCATCTGGAAGTGTTGCATCAGTTCATTCCAAAAATTTGGTGCAACAATTACAGGATACTCAACGAGTATCAGCCGGATTTGgt -1157 

SHP163 T 

gagtttgaaatttaggaaacatttggatgaaatgtattttttaaaaatagatcagctttatttatttgaaaaaaaacgctcattaatcaatagtgatagt -1057 

tccattctgagtttcttcttcttcctcgcggaatacaatttttgacttgttcgcatccttcttgtgtactttgtcaccaatcttctcatcaactaaatct -957 

cgaaactgaaaaaatttcaaaattattccaaaaaatattgatgcagactacctttttgatggcttctggtacgtttctagcgtcgaatggattggctcct -857 



ccaataattaaagtctcgttcggtagtttagccagacggacggtgtgcttcaacatttttctaattaatctatttcaattcaagtcactcactctctctt -757 
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gacgtcttcttctatattccaagaactctgcagaaaatccgtgtccgccttgtgtgtttctagttggcgtcggaggattcacgggtccaagacgaatgga -657 

tgtctaaaaaatgttatatttttgcataaagaaaacaccataccttcaccactttttgagttgtgggcgttctgaatggaattgatcgattattafctgct -557 

ctttcttgatttgcttctatcagctgcgtaatgaggtgttctaaagatcagctttaattcatttggacaagtgctcctctaataaacttaccctgtactc -457 

atttttgaaacgatttacgatgataagattgaaagtggaagttaaatttagtctttcaaagttgaaataaaatcttcataaataaataaatttaaatgaa -357 

L A F V F K S 401 

agattaaataaattaacgttcacgtagttaaaaaaataatttaaatcttaaacttctaataaaaaatctcaattttccagGACTCGCATTCGTGTTCAAAA -257 

IFRLELNYTYPLKYVIGDSLLGGFHIGAGVNFL 434 
GTATTTTCCGGCTGGAACTCAACTACACGTATCCATTGAAATATGTGCTCGGCGATTCATTGCTCGGTGGATTCCATATTGGAGCTGGTGTCAACTTCTT -157 

1 

Gtagaga ttaattggatgcaagcacccct caaaaagatttttttgaaaaacgataaattcacagaatttcagttctttttctcccccttttattgttatt -57 
SHP134 ? 

ttcatcgtaa tgctgtgctagaagtcagag taaatatgagtttttttgtgttctaggaattccattttttcaggaagcaaatttaataaaaattatcgaa 44 
SHP164 T 

polyA 

r 

tttcttgctctaaagatgttgtacattttat ggaaatgttcgtatagtaa 94 

T SHP135 



kp-l 
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SL2 

-\ MSLRKINFVTG 11 
ttcgaacactttatatttctcgttttaaaactgtcggtqttttatagtaaactatcttcagaaaaaaATGAGCCTACGA AAAATCAATTTCGTAACTGGR 194 

SHP91 T SHP118 * 



NVKKLEEVKAILKNFE 27 
MCGTGAAGAAGCTTGAAGAAGTCAAGGCTATTTTGAAGAATTTCGAGgtaaaatatatttgatattattcgaacgcgaaattttgcgccaaaagtacga 294 



tgcctggtctcaacacgacaatattttgttaaatacaaacgaatgtgcgccttcaaagaaaagtttcaatctttcgttgccgtggagatatttttagagt 394 



V S N V D V D L D E F 38 

ttttgtttaaattatatatttgtcgtatcgaaaccgggtaccgtaatcaatcaattaaatattttcagGTTTCAAACGTGGATGTCGATTTGGATGMTT 494 

T SHP165 

QGEPEFIAERKCREAVEAVKGPVL 62 

CCAAGGAGAACCCGMTTTATTGCCGAAAGAAAGTGCCGTGAGGCTGTTGAAGCTGTAAAAGGGCCCGTTTTGgtatggaaaattgtatttgttctaaaa 594 



VEDTSLCFNAMGGLPGPYIKWFLKNLKPE 91 
attgtcaaatttcagGTCGAAGACACAAGTTTATGCTTCAACGCAATGGGCGGTCTTCCTGGACCTTATATCAAGTGGTTTTTGAAGAATTTGAAACCAG 694 

T 

SHP129 



# 
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G L H N M L A GFSDKTAYAQCIF 111 

AAGGACTACATAATATGCTAGgtaaatattttaattttttgaaaaaacttatttttcagCCGGATTTTCTGACAAAACCGCCTATGCTCAATGCATCTTT 794 



AYTEGLGKPIHVFAG 126 
GCGTACACTGMGGACTCGGAAAACCTATTCATGTAmGCTGqtatgattttttgaatttaattctttaattttatatgttaatttagttgttt 894 



KCPGQIVAPRGDTAFGWDP 145 
ctcaatttatqagaqatttttttttcaatttttctatttcaqGAMATGTCCTGGTCAAATTGTTGCTCCACGT GGTGATACTGCTTTTGGATGG GATCC 994 

Y SHP130 



CFQPDGFKETFGEMDKDVKNEISHRAKALELLK 178 
ATGCTTCCAGCCAGATGOTTTAAAGAMCATTCGGAGMTGGATAMGATGTAAAMTGMTTTCTC 1094 

T SHP119 SHP120 T 



E Y F Q N N • 184 

GMTATTTTCAGAATAATtaaattattttttctcatctatgcaatttcttgaaaatttgttaagtttccgttgttatgcatttgcttttatttaaaaaaa 1194 



polyA 

r 

aaagaatatttttacattaatattagatatgagaaaagagtaatttctggattttaaccttcctacaaaagaatatttatattttttgtatgatttttta 1294 

f 

SHP93 



